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ABSTRACT 

Previous  detection  studies  assumed  that  a  "target"  was  either 
present  in  a  region  of  interest  with  a  probability  p,  or  not,  with  a 
probability  (l-p).  Knowing  the  value  of  p,  the  "searcher”  makes  a 
measurement  of  a  random  variable  that  has  a  probability  density  function 
which  depends  on  whether  or  not  the  target  is  present  and  attempts  to 
make  a  decision  regarding  the  presence  of  the  target.  In  this  paper 
a  more  general  point  of  view  is  adopted  in  that  we  allow  the 
possibility  of  both  target  and  non-target  elements  to  be  present  in 
any  region  of  measurement.  This  provides  an  explicit  consideration  of 
a  third  possibility,  that  of  a  mixture  of  target  and  non-target 
elements . 

The  forms  of  the  probability  densities  for  the  "mix"  variate  are 
obtained  for  a  variety  of  assumptions  regarding  the  relationships 
among  the  variates  involved.  A  sequential  decision  problem  is 
extended  to  include  the  three  value  case. 
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INTRODUCTION 


During  the  last  two  decades ,  a  well  developed  theory  of  detection 
has  been  established.  Detection  theory  is  an  adaptation  of  the 
statistical  theory  of  hypothesis  testing  to  problems  arising  in  pattern 
recognition  and  radar,  communication,  and  control  engineering.  A 
pattern  recognition  device  is  said  to  consist  of  two  parts:  a  receptor, 
which  generates  a  set  of  measurements  of  the  physical  sample  to  be 
recognized,  and  a  categorizer,  which  assigns  each  set  of  measurements 
to  one  of  a  finite  number  of  classes. 

The  purpose  of  the  present  paper  is  to  discuss  a  new  problem  in 
detection  theory.  This  problem  arises,  among  other  ways,  when  one 
attempts  to  detect  a  pattern  in  a  two-dimensional  optical  display 
(hereafter  called  picture).  The  picture  is  to  be  scanned  using  a 
receptor  of  a  fixed  size  and  shape.  For  each  positioning  of  the 
receptor,  one  obtains  a  set  of  measurements  from  the  Msubpicturelf  being 
sensed.  Each  measurement,  x,  can  be  thought  of  as  a  random  variable, 
and  is  sometimes  called  a  local  property  of  the  picture.  For  pattern 
recognition  problems  a  discriminator  (human  or  machine)  may  be  assigned 
the  problem  of  determining  whether  x  is  a  sample  from  one  or  another 
class  of  inputs,  say  a  "target  scene"  or  perhaps  a  "non-target  scene". 
The  term  "target  scene"  (or  "target")  is  defined  to  be  an  object  of 
primary  interest  to  a  decision-maker. 

Previous  detection  studies  [l],  [2],  [3],  assumed  that  a  target 
was  either  present  in  a  region  of  interest  with  a  probability,  p,  or 
not,  with  a  probability  1-p.  Knowing  the  value  of  p,  the  searcher  makes 
a  measurement  of  a  random  variable  that  has  a  probability  density 
function  which  depends  on  whether  or  not  the  target  is  present,  and 
attempts  to  make  a  decision  regarding  the  presence  of  the  target.  In 
this  paper  a  more  general  point  of  view  is  adopted  in  that  we  allow  the 
possibility  of  both  target  and  non-target  elements  to  be  present  in  any 
subpicture.  We  shall  therefore  be  concerned  with  classifying  a 
measurement  as  being  a  response  from  one  of  three  sources,  namely, 

* 

References  in  brackets  may  be  found  on  page  27. 
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1.  a  subpicture  containing  target  elements  only, 

2.  a  subpicture  containing  non-target  elements  only, 

3-  a  subpicture  containing  both  target  and  non-target 

elements . 

The  same  problem  arises  [4]  in  certain  analyses  of  radar 
search  systems.  The  procedure  is  to  continually  sample 
a  given  range  "bin"  until  a  yes-no  decision  concerning 
the  presence  of  a  target  is  made.  One  often  wishes  to 
provide  for  the  possibility  of  a  target  emerging  into  the 
bin  during  the  sampling  process. 

The  three  categories  of  subpictures  will  be  called  pattern  classes, 
and  denoted  by  R^,  R^,  and  R  ,  respectively.  To  treat  the  problem  at 
hand  let  p(x|R^)  be  the  probability  of  occurrence  of  the  measurement, 
x,  given  that  it  belongs  to  class  R^.  We  shall  assume  that  p(x|R^)  and 
p(x|R0)  are  known  and  set,  for  our  first  task,  the  determination  of 
p(x|Rj).  To  facilitate  the  following  discussion,  we  shall  denote  the 
measurement  x  by  r,  s,  or  u  depending  on  whether  x  is  a  measurement 
from  Rn  ,  R0,  or  R_,,  respectively.  Furthermore  we  let  f^  (r)  =  p(r|R^), 
f^(s)  =  p(s|R^),  and  h(u)  =  p(u|R  ).  The  next  section  shall  be  devoted 
to  determining  h(u)  under  various  assumptions  about  the  form  of  u. 


V. 


The  first  three  expressions  result  when  one  conceives  of  x  as 

some  average  value  taken  over  the  subpicture.  In  picture  processing  x 

could  be  an  average  value  of  some  textural  property,  say  average 

intensity.  One  can  think  of  the  subpicture  S  as  partitioned  into  two 

parts ,  P-j  and  Pp,  where  P^  contains  target  elements  only  and  Pp 

contains  non-target  elements  only.  Let  and  be  vector  parameters 

which  somehow  characterize  the  geometric  shapes  of  P^  and  Pp.  Let 

area  (P^)  1 

v(=  - r  \  )  be  the  fraction  of  the  subpicture  containing  target 

area  v  o  y 

elements  only.  The  first  equation  expresses  u  as  a  weighted  average 
of  r  and  s.  The  measurement  r  is  taken  over  P^;  the  measurement  s  is 
taken  over  P  .  In  general  the  measurement  r  will  be  a  function  of 
both  w  and  S^.  Similarly  s  is  a  function  of  w  and  S^.  As  P^  is  the 
complement  of  P  with  respect  to  S,  a  specification  of  also  yields 
Sg*  Therefore  we  can  rewrite  expressions  I  and  IV  in  the  form 

I’  u  =  wr(w,S^)  +  (l-w)s(w,S  )  , 

IV’  u  =  r(w,S1)  +  s(w,S1)  . 

In  expression  II ;  r  and  s  are  deemed  to  be  independent  of  and 
S^.  In  expression  III,  r  and  s  are  assumed  to  be  independent  of  w  and 
S^.  Expressions  IV  and  V  arise  naturally  when  one  considers  the 
measurement  x  to  be  a  count  of  the  number  of  occurrences  of  a  certain 
event  in  a  subpicture  (such  as  the  number  of  edges,  or  perhaps,  thc- 
number  of  closed  contours).  The  order  in  which  we  shall  discuss  the 
expressions  will  be  II,  III,  V,  I’ ,  and  IVT  . 

1.2  Case  1.  u  =  wr(w)  +  (l-w)s(w). 

In  this  case  we  have  assumed  that  r  and  s  are  independent  of 
and  S^.  The  quantities  r,  s,  and  w  are  random  variables  with  densities 
f_1(r),  fp(s),  and  k(w)  respectively.  Our  objective  is  to  derive  an 
expression  for  the  density  function,  h(u) ,  of  the  variate  u.  The 
variate  u  is  a  monotonic  function  of  s  when  r(w)  and  w  are  held  constant. 
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To  obtain  the  distribution  of  u  we  imagine  that  the  two  variates  r(w) 
and  w  are  held  fixed  at  some  arbitrary  value  while  s  varies  over  its 
possible  range  according  to  its  distribution.  The  probability  distri¬ 
bution  of  u  which  then  results  is  the  conditional  distribution  of  u 
given  r(w)  and  w.  Moreover,  u  becomes  a  mon  otonic  function  of  the 
variable  s  although  the  equation  relating  u  and  s  will  contain  the 
parameters  w  and  r(w).  Hence  the  conditional  density  9(u|w,  r(w))  can 
be  derived  from  fg(s|w)  by  a  univariate  method  giving 


cp(u|w,  r(w) )  =  ^  f2 


M 


w)  . 


Furthermore  one  can  then  produce  the  joint  density  function,  g(u,r(w)|w), 
of  u  and  r(w)  conditioned  upon  w  by  the  expression 


g(u,r (w)  | w)  =  fi(r(w)|w)  fg(U~^V^- 


v) 


One  obtains  the  density  of  u  (conditioned  on  w)  by  integrating  the 
function  g(u,r(w)jw)  with  respect  to  r(w),  taking  pains  to  keep  the 
argument  of  the  function  f^  in  the  range  of  the  variable  s.  One 
obtains 


h(u|w)  =  — 


J  f-L(r(w)|w) 

-T(u,w) 


-wr  (w) 


-w 


w)  dr 


In  the  above,  the  set  T(u,w)  =  |V(w)|r(w)  e  R(w)  and  e  S(w)j"  , 


where  R(w)  is  the  range  of  r(w)  and  S(w)  is  the  range  of  s(w).  If  we 
let  R(w)  =  [c(w),  d(w) ] ,  and  S(w)  =  [a(w),  b(w)],  then 


T(u,w)  =  <fr|c(w)  ^  r  ^  d(w)  and  a(w)  ^  ^  ^  b(w)|  . 


/  \  u— wr  /  \ 

From  the  inequalities  a(w)  ^  ^  b(wj,  we  obtain 

(l-w)a(w)  +  wr(w)  ^  u  ^  (l-w)b(w)  +  wr(w) .  From  Figure  1  we  can 
visualize  the  dependence  of  the  limits  of  integration  upon  u  and  w. 
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FIGURE  1 


Hence 


U(u,w) 


h(u|w)  = 


1-w 


f  (r(w)|w)  f  ( 


u 


-wr(w) 


L(u;w) 


2'  1-w 


w)  dr, 


with 


and 


L  =  max 


£  u  -  b(w))  > 


U  =  min  (—  u  -  a(w);  d(w)) 

vw  w 


There  are  two  cases  to  distinguish.  If 
(l-w)a(w)  +  wd(w)  ^  (l-w)b(w)  +  vc(v),  then  T(u,v)  is 

[c(w),  (l-v)a(w)  +  u]  for  (l-w)a(w)  +  wc(w)  £  u  ^  (l-w)a(w)  +  wd(w) 

■  [c(w),  d(w)]  for  (l-w)a(w)  +  wd(w)  S  u  ^  (l-w)b(w)  +  wc(w) 

[ (l-w)b(w)  +  wu,  d(w) ]  for  (l-w)b(w)  +  wc(w)  ^  u  £  (l-w)b(w)  +  wd(w). 

If  (l-w)a(w)  +  wd(w)  s  (l-w)b(w)  +  wc(v),  then  T(uJ)w)  is 

[c(w).  (l-w)a(w)  +  wul  for  (l-w)a(w)  +  wcfw)  <  u  ^  (l-w)b(w)  +  we(w) 

(l-w)b(w)  +  vru,  (l-w)a(w)  +  wu]  for  (l-w)b(w)  +  wc(w)  <  u  S  (l-w)a(w)  +  wd(w) 

[ (l-w)b(w)  +  wu,  d(w) ]  for  (l-w)a(w)  +  wd(w)  ^  u  ^  (l-w)b(w)  +  wd(w). 
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Finally;  to  determine  h(u)  we  must  integrate  with  respect  to  w;  the 
result  is 

h(u)  =  J  k(w)h(u|w)  dw 
W 

1.3  Case  2.  u  =  wr  +  (l-w)s. 

In  this  case  we  have  assumed  that  r  and  s  are  independent  of  w  and 
the  shape  vectors  S.^  and  S^.  As  in  Section  1.2  the  objective  is  to 
determine  the  density  function,  h(u) ,  for  the  variate  u.  Before 
proceeding  to  the  general  formulation  we  shall  consider  several  special 
cases. 

1.  Suppose 

a.  r  and  s  are  constants, 

hs  w  is  uniformly  distributed  on  [o,m];  0  <  m  ^  1, 
c .  r  >  s . 

Then  u  is  uniformly  distributed  on  [s,  s  +  m(r-s]. 

2.  Suppose 

a.  r  is  a  constant, 

b.  s  and  w  are  statistically  independent, 

c.  s  has  a  uniform  distribution,  f^s),  on  [a,b], 

d.  w  is  uniformly  distributed  on  [0,l]. 

We  obtain,  using  rationale  similar  to  that  of  Section  1.2, 


h(u) 


b-a 
T(u) 


f 

2V  1-w 


)  dw 


> 


or 


(b-a)h(u)  =  J  ^  dw 
T(u) 


> 


where 
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'(u)  =  { 


w  0  —  w  ^  1  and  a  ^ 


^  b}  . 


We  jonsider  three  subcases: 


r  <  a.  Then 


T(u)  = 


|”  a-u  b-u~l 


for  r  <  u  <  a 


i.a-r  ’  b-rJ 


0  ,7—^  for  a  <  u  <  b 

'  b-rJ 


(b-a)h(u)  = 


for  r  ^  u  <  a 


L  in(  — — )  for  a  <  u  <  b 


The  graph  of  this  function  has  the  form 

.  .  ,  A 

(b-a)h(uj 


a  <  r  <  b.  Then 


b  u 


f  <°  - 


for  a  ^  u  <  r 


for  r  <  u  ^  b 


-)  for  a  <  u  <  r 


(b-a)h(u) 


in(^) 

u-r 


iUI  U  =  L 


ror  r  <  u  <  b 


whose  graph  has  the  form 
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iii.  r  >  b.  Then 

T(u)  =  [a,  min(u,b)]  , 


and 


M— ) 

r-u 


for  a  ^  u  <  b 


(b-a)h(u)  =< 

I  in(^— for  b  ^  u  <  r 
The  graph  of  (b-a)h(u)  has  the  form 


In  deriving  the  general  form  for  h(u) ,  we  choose  to  fix  the 
variables  r  and  s  at  constant  values  initially.  Again  we  see  that  u 
behaves  as  a  monotonic  function  of  w  when  r  and  s  are  held  constant. 
Therefore  the  conditional  density  i(u|r,s)  can  be  derived  from  the 
probability  density  of  w ,  k(w) ,  by  a  univariate  method,  yielding 

«u|r,s)  =  k(j5f)  |^|  • 

Moreover  one  can  derive  the  joint  density  function  g(u,r,s)  from  the 
expression 


lh 


g(u,r,s)  =  f(r,s)i(u|r,s) 


where  f(r,s)  is  the  joint  density  function  of  r  and  s.  Finally  the 
density  of  u  is  obtained  by  integrating  g(u,r,s)  with  respect  to  r  and 
s.  Thus  we  have 

h(u)  =  J  J  f (r,s)  k(^f)  |^|  dr  ds  (1.5.1) 

T(u) 

where 

T (u)  =  {(r,s)|r  e  R,  s  e  S,  and  e  wj  , 


and 


R  =  range  of  r  , 
S  =  range  of  s  , 

W  =  range  of  w. 


Letting  R  =  [c,d],  S 
and  w  are  independent,  we 


=  [a,b],  W  =  [0,1],  and-  assuming  that  r,s, 
obtain  the  expression 


where 

p(u) 
P'  (u) 


h(u)  =  r 


d  p(u) 


7(u) 


f1(r)f2(s)k(~)  (^)  ds  dr  + 


rP'(u) 


r  r  ,  ,u-sv  ,  i  s  .  . 

J  ,  J  dr  ds  , 

y'  (u)  c 


max  {a,  min(u,  b)}  5  7(u)  =  min  [d,  max(c,u)}  , 
max  {c,  min(u,b)}  ,  y' (u)  =  min  {b,  max(u,a)}  . 


(1.3.2) 


The  two  integrals  evolve  as  a  result  of  partitioning  T(u) 


sets  T-^(u)  and  Tg(u) ,  where 

Tx(u)  =  j 

jb,s)  jr  e  R,  s  e  S,  r  >  s,  and  0  s  <,  lj 

and 

T2(u)  =  j 

[(r,s)|r  e  R,  s  e  S,  r  <  s,  and  0  <  ~  ^  lj 

into  two 
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or  equivalently, 

T^(u)  =  {(r,s)|r  e  R,  s  e  S,  and  s  ^  u  <  r]  , 

and 

T^(u)  =  {(r,s)|r  e  R,  s  e  S,  and  r  ^  u  ^  s}  • 

1.4  Numerical  Example  for  Case  2. 

Suppose 

a.  r,  s,  and  w  are  statistically  independent,  uniformly 
distributed  random  variables, 

b.  R  =  [c,d],  S  =  [a,b],  and  W  =  [0,l]  , 

c.  for  definiteness,  0<c<a<d<b  . 

Then 

(d-c)  (b-a)h(u)  =  I-^u)  +  I2(u)  , 

where 

f  0  for  u  =£  a 


Ix(u)  =  ■ 

(d-a)in(d-a)  +  (7(u)  -  3(u))in(/(u)  -  3(u)) 
-(d-P(u))in(d-0(u))  -  (7(u)-a)in(7(u)-a)  for  u>  a 

and 

I2(u)  =  (b-c)in(b-c)  +  (7'(u)  -  0 ' (u) )in(7' (u)  -  P'(u)) 

-  (b-01 (u))in(b-0’ (u))  -  (7' (u)-c)in(7’ (u)-c)  . 


Furthermore 

(d-c) (b-a)h(u) 


+  h-^(u)  for 

k0  -  hp(u)  for 

k,  +  h, (u)  for 

3  3 


c  ^  u  <  a 

a  <  u  ^  d  , 

d  <  u  £  b 


where , 
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and 


k^  =  (b-c)in(b-c)  -  (a-c)in(a-c) , 

k^  =  (d-a)in(d-a)  +  (b-c)in(b-c) , 

k^  =  (b-c)in(b-c)  -  (b-d)in(b-d) , 

h  (u)  =  (a-u)in(a-u)  -  (b-u)in(b-u) , 

h,_,(u)  =  (d-u)in(d-u)  +  (u-a)in(u-a)  +  (b-u)in(b-u) 

+  (u-c)in(u-c) , 


h^(u)  =  (u-d)in(u-d)  -  (u-c) in(u-c) .  b 

A  very  lengthy  and  tedious  computation  verifies  that  J  h(u)  du  =  1. 

c 

Ix(u)  +  I2(u) 

Let  c=l,  a=2,  d=3,  and  b=4.  Then  h(u)  =  - r -  ,  and 

k-j^  =  k2  =  k,  =3  in  3. 

The  graph  of  h(u)  is  given  in  the  following  figure. 


We  shall  make  use  of  this  example  again  in  Section  2.1. 


1.5  Plausible  Density  Functions  for  the  variable,  w. 


In  this  section  we  shall  derive  several  plausible  density 
functions,  k(w),  for  the  random  variable  w.  We  shall  describe  the 
circumstances  which  give  rise  to  these  functions  and  discuss  the 
importance  of  obtaining  a  catalog  of  such  functions. 

Consider  the  problem  of  scanning,  horizontally  a  two-dimensional 
optical  display,  such  as  a  photograph  or  a  television  image  with  a 
receptor  (window). 


IT 


Suppose , 


a.  the  target  in  the  display  is  rectangular -shaped 

b.  the  window  is  square -shaped 

c.  the  edges  of  both  the  window  and  target  are  parallel  to 
the  corresponding  edges  of  the  optical  display  as  illustrated, 


d.  each  possible  position  of  the  window  is  equally  likely. 

Let  the  window  be  a  unit  square  and  let  the  target  be  S(>  l)  units 

long  and  T(>  l)  units  wide.  Let  w  be  the  ratio, 
area. of  overlap  of  target  and  window  .  Wg  wish  tQ  find  thfi 
area  of  window 

probability  density  function,  k(w),  given  there  exists  an  overlap  of 
window  and  target.  Assuming  that  each  admissible  position  of  the 
window  with  respect  t6  the  target  is  equally  likely  we  can  derive 


the  expression 


k(w)  = 


S  +  T  -  2  -  2  In  v 


;  0  <  w  £  1. 


For  S=^T ,  we  obtain, 


k(w)  = 


S  -  1  -  in  w 


;  0  <  w  ^  I. 


As  either  S  or  T  approaches  infinity,  k(w)  approaches  a  uniform 
distribution  on  [0,l]. 
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Suppose 


a.  the  target  and  window  are  circular-shaped  with  radii  of 
R(>  l)  units  and  one  unit,  respectively, 

b.  each  possible  position  of  the  window  is  equally  likely, 

c.  w(s)  defines  the  ratio, 

area  of  overlap  of  the  target  and  window 
area  of  window 


Then 


w 


(s)  = 


Vs)  +  S2<s> 

TT 


where 


S1(s)  =  \ 


TT 

2 


y(s)Vl-y2(s)  +  Sin  1  y(s)  for  0  ^  y(s)  ^  1 


+  _ly(s)lVl-y"(s)  +  Sin  J’|y(s)  [j  for  -1  s  y(s)  <  0 


TTR 


S2(s)  =  ^ - [(t(s)  -  y ( s)  )Vr2  -  (t(s)  -  y(s))2  +  R2Sin  1(t^d  ~-yb)-) 


and 


t(s)  =  R  -  1  +  s  , 


v(s)  =  s  r  2^R--IUs--Ii  . 
7K  '  2(s  +  R-l) 


The  variable  s  is  a  random  variable  having  a  density  function 

*<B>  - lui$rJL  0SsS2- 


Estimates  of  the  probability  density  function  for  w  may  be  obtained 
by  computer  simulation.  One  could  sample  s ,  many  times,  from  a 
discrete  version  of  its  cumulative  distribution,  evaluate  w(s),  and 
construct  a  discrete  approximation  to  k(w).  The  procedure  could  be 
repeated  for  a  family  of  R  values. 

For  the  discussion  above,  we  selected  two  examples  where  the 
variate  w  could  range  in  the  interval  [0,l].  Another  consideration 
is  motivated  by  Figures  la«  and  lb. 

19 


la.  lb. 


In  Figure  la  the  target  is  smaller  than  the  window.  In  Figure 
lb  we  see  an  elongated  target,  so  that  the  target  never  completely 
fills  the  window.  These  examples  show  that  the  values  of  w  might  be 
restricted  to  an  interval,  say,  [0,m],  where  m  <  1. 


The  computations  of  density  functions  for  w  for  such  cases, 
and  the  implications  with  regard  to  Equation  1.5*1  will  be  given  in 
another  paper. 

Our  purpose  is  to  provide  a  catalog  of  plausible  density  functions 
for  the  variate  w.  A  function  from  the  catalog  could  prove  to  be 
useful  as  an  approximation  in  an  application  where  a  target  shape  is 
not  regular.  Moreover,  knowledge  of  the  properties  of  the  functions 
might  dictate  the  design  of  receptor  shapes. 

1.6  Use  of  Histogram  Data  in  Conjunction  with  Equation  (1.3-2) . 


In  many  pattern  recognition  problems,  probability  density 
functions  for  r  and  s  are  approximated  by  histograms  obtained  from 
sample  sets,  frequently  called  training  sets. 


With  this  in  mind,  let  us  define, 

m 

fx(r)  = 

i=l 


and 


f2(s)  =  l  bjHU-s.)  , 

j=l 


where 


(1.6.1) 


(1.6.2) 
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H(x)  = 


'  1  ;  x  >  0 

0  ;  X  s  0 

We  shall  determine  the  function  h(u)  with  f^r)  and-  defined  "b y 

equations  ( 1.6.1)  and  (1.6.2)  and  a  uniform  density  function  for  w. 
It  follows  from  equation  (l.3*2)  that 

m  n 

h(u)  =  y  y  ajD.  [^(ijjju)  +  I2(i,j,u)]  , 
i=l  j=l 


where 


and 


ana 


I2(i> j,u) 


3  (u)  =  max  [s.,  min(u,b)]  , 

J 

7^(u)  =  min  [d,  max(r^,u)]  , 

P2(u)  =  max  [r  ,  min(u,d)]  , 

7  (u)  =  min  [b,  max  (u,s.)]  . 

—  J 

For  w  uniformly  distributed  on  [0,l],  we  obtain 

I^ijjju)  =  (d-s  )^n(cl-s  )  +  (y^u)  -  P1(u)  )^n(71(u)-P1(u)) 

-  (d-P1(u))in(d-31(u))  -  (71(u)  -  sJ)in(71(u)  -  s^)  , 

and 

I_(i,j,u)  =  (b-r  )^n(b-r  )  +  (70(u)  -  Mu)  )in(yp(u)  -30(u) ) 

CL  _L  J-  CL.  C.  C.  CL 

-  (b-P2(u))in(b-P2(u))  -  (72(u)-r1)in(72(u)-ri)  . 
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1.7  Case  3*  u  =  r(w)  +  s(w)« 


Proceeding  as  in  the  earlier  sections,  we  obtain 

h(u)  =  J  k(w)  f n (r | w)  fQ(u-r(w))  dr]  dw 


W 


T '  (u.w) 


1.8  Case  b .  u  =  vr(w,S^)  +  (l-w)s(w,S^)  and  u  =  r(v,S^)  + 

The  computation  of  h(u)  for  these  expressions  seems  to  be  very- 
difficult.  One  requires,  initially,  a  characterization  of  P  ,  that 
part  of  the  subpicture  containing  target  elements  only.  For  complex 
(say  non-convex)  target  shapes  and  for  arbitrary  target  orientations 
within  the  subpicture  this  task  appears  to  be  extremely  complicated. 


1.9  Extension  to  a  Set  of  Measurements. 


Let  us  suppose  that  for  each  positioning  of  the  receptor,  one 
obtains  several  measurements,  xn,x0,...  x^ .  Let  X  =  (xn , . . . ,x^ ) .  We 
are  interested  in  determining  p(x|R^),  p(x|R^)  and  p(x|R^).  Let  us 
assume  that  for  each  of  the  classes  R^,  R^,  and  R^  the  components  of 
X  are  statistically  independent.  This  allows  us  to  write 


p(x|R.)  =p(x1|r.)  p(x0|R.)  ...  p(xJr.)  ;  1=1, 2, 3 


and  the  theory  discussed  in  sections  1.1  to  1.8  is  applicable. 


2.  CLASSIFICATIONS  OF  PATTERNS 
2.1  Bayes  Analysis. 

Here  we  use  decision  surfaces  of  our  pattern  classifier  which  are 
defined  by  a  set  of  functions  g^(x) ,  gg(x),  and  g  (x).  These  functions, 
called  discriminant  functions  are  chosen  such  that  for  all  x  in  R  , 
g, (x)  >  g  (x)  for  i,j=l,2,3,  i  ±  3- 

J 

The  patterns  in  each  of  the  three  categories  R^,  R^,  and  are 
random  variables  governed  by  the  probability  functions  p(x|R  )  =  f^(x), 
p^R^)  =  f^x),  and  p(x|R^)  =  h(x) .  An  additional  set  of  values 
which  is  needed  in  order  to  construct  the  discriminant  functions  is 
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the  set  of  a  priori  probabilities ,  fp(Fh)},  i=l,2;3*  The 
discriminant  functions  will  be  expressed  in  terms  of  the  p(x|Ri)  and 
p(R/).  Our  pattern  classifier  will  be  optimum  in  a  Bayesian  sense 
[5];  [6]  if  we  let 


3 

g  (x)  =  -  y  X (i | j)  p(xjR  )  p(R  ), 
±  - 1  J  J 

j=l 


where  \(i| j),  the  "loss  function"  represents  the  loss  incurred  when 

the  classifier  places  a  pattern  actually  belonging  to  the  class  R.  into 

J 

category  R^.  Therefore  the  pattern  classifier  makes  its  classifications 
by  the  following  steps: 


1.  the  measurement  x  is  presented  to  the  classifier. 

2.  the  classifier  computes  Max  [g-^(x),  g^(x) ,  g^(x)]  and 
decides  in  favor  of  the  category  associated  with  the  function  yielding 
the  largest  value. 


A  loss  function  is  said  to  be  symmetric  if  it  is  of  the  form 


\(i 

-1  j)  =  1 

where 

0  if 

i  ^  0 

B .  .  =J 

1J 

1  if 

i  =  j 

For  such  loss  functions,  the  discriminant  functions  turn  out  to 
be 

gi(x)  =  p(x|Ri)  p(R1)  ;  i=l,2,3. 


If  all  of  the  classes  are  equally  likely  a  priori,  i.e. 

p(Rx)  -  p(R2)  =  p(R3)  =  | 

we  need  only  compute  p(x|R^)  for  i=l,2,3  and  select  the  maximum  This 
decision  is  called  the  maximum  likelihood  decision. 
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Returning  to  the  numerical  example  in  Section  1.4,  we  shall 
determine  the  maximum  likelihood  decision  for  each  x  in  the  interval 
[1,4].  The  results  are 


class  R^ 

if 

x  e 

[1,2] 

decide  in  favor  of  < 

class  R_, 

3 

if 

x  e 

[2,3] 

taclass  R^ 

if 

x  e 

[3,4] 

2.2  Sequential  Analysis. 

The  result  of  a  measurement  for  the  subpicture  is  the  value  x 
which  has  the  probability  density  function  p(x|R^)  or  p(x|R^)  or 
p(x|r^)  depending  on  whether  the  subpicture  contains  target  elements 
only,  non-target  elements  only,  or  both  target  and  non-target  elements 
Following  each  measurement,  the  observer  makes  one  of  the  following 
four  decisions: 

D^:  decide  target  is  present. 

D^:  decide  target  is  not  present. 

decide  both  target  and  non-target  elements  are  present. 

W:  wait  for  another  measurement. 

The  decisions  D^,  D^,  and  are  terminal  decisions,  completing 
the  process.  The  losses  of  the  searcher  making  decision  R^  given  R^ 
is  true  are  X(i|j).  We  shall  assume  that  X(l|l)  =  X ( 2 | 2)  =  X ( 3 | 3 )  =  0 
The  decision  W  allows  the  process  to  continue  at  least  one  more  time 
step  (i.e.  permits  the  observer  to  make  at  least  one  more  measurement) 
The  loss  incurred  by  this  delay  will  be  assumed  to  be  dependent  on 
which  class  R.  we  are  measuring.  Let  equal  the  delay  loss  incurred 
if  R_^  is  truly  the  class  being  measured.  The  objective  of  the 
searcher  is  to  minimize  the  expected  cost  of  a  search.  The  decision 
policy  that  achieves  the  minimum  expected  cost  is  called  the  noptimaln 
policy.  Following  Pollock  [2]  we  can  write  a  functional  equation 
which  will  yield  the  optimal  policy.  The  equation  is  an  application 


2h 


of  Bellman’s  Principle  of  Optimality  [T ] •  Let  ffp-^^Pg)  Le  the 

minimum  cost  of  search  obtained  using  an  optimal  policy  where 

=  p(FL  )  is  the  a  priori  probabilities  associated  with  the  occurrence 

of  the  class  R. . 

1 


We  are  interested  in  determining  the  function  f(p^^p^)  which  is 
a  function  of  the  current  information  about  probabilities  of  the 
occurrence  of  R  and  R^.  We  are  equally  interested  in  determining 
D(p1^p2)  the  decision  that  one  should  make  as  a  function  of  p^  and  p^. 

A  more  realistic  problem  arises  when  one  considers  that  there  exist  at 
most  only  n  available  observations  remaining  before  a  terminal 
decision  MUST  be  made.  If  the  decision  W  is  made,  at  the  next  decision 
there  will  be  only  n-1  possible  observations  left.  If  n=Q,  then  one  of 
the  terminal  decisions  must  be  made.  Letting  £n(Pj_>P2)  Le  the  cost 
search  using  an  optimal  policy  given  there  are  n  available  observations 
remaining  before  a  terminal  decision  must  be  made,  we  get 

decision  D. 


fn(pl'P2^  =  min^ 


fp2X(l|2)  +  P5\(l|3) 
PxX(2|l)  +  P5X(2|3) 


P^(3|l)  +  P2X(3|2) 


decision  D, 


decision 


p1p(x|r1)  ^^(pppy  a*  ; 


i=l 


decision  W. 


For  n=0, 


P2X(l|2)  +  P3X(l|3) 
f^(pn;P0)  =  min]  Pix(2|l)  +  ( 2 1 3 ) 


If  the  decision  is< 


D, 


D, 


D, 


[Pl\(3|l)  +  P2X(3|2)  j  D, 


^  respectively,  the  expected  losses  incurred  due 


to 
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terminal  incorrect  decisions  are 


'  p2x(i|2)  +  p5x(i|3)  ’ 

.  P=lX(2|  1)  +  p5X(2|3)  . 
PlX(5|l)  +  P2X(3|2) 


respectively. 


If  the  decision  is  W,  the  expected  cost  is  p_W_.  +  p_W0  +  p_W_  plus  the 

11  d  d  55 

cost  of  continuing  from  that  point  on,  having  observed  some  value  of  x. 
The  probability  of  observing  a  value  between  x  and  x  +  dx  is 
[p1P(x|R1)  +  p^p^R^)  +  p^p(x|R^)]  dx*  Having  observed  the  value  x, 
the  probabilities 

I  N  P(XIRX)  Pp 

P1  "  p^Rib)  =  pip(x|R1)  +  PgPCxlRg)  +  p5p(x|R5)  ' 

and 

/rt  ,  N  p(x|r2)  P2 

P2  -  p(r2|x)  -  pip(x(ri)  +  p2p(x|R2)  +  PjP(x|R^) 

are  obtained  from  Bayes  rule. 


Although  the  basic  formalism  of  dynamic  programming  carries  over 
without  change  from  the  two  class  detection  problem,  we  have  introduced 
an  additional  "state"  variable  into  the  functional  equations.  This 
introduction  causes  some  further  numerical  difficulties  which  may  be 
resolved  by  any  one  of  several  techniques  recorded  in  [ T ] • 
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